Accelerating Deformable Convolution Networks with Dynamic and Irregular Memory Accesses
نویسندگان
چکیده
Deformable convolution networks (DCNs) proposed to address the image recognition with geometric or photometric variations typically involve deformable that convolves on arbitrary locations of input features. The change different inputs and induce considerable dynamic irregular memory accesses which cannot be handled by classic neural network accelerators (NNAs). Moreover, bilinear interpolation (BLI) operation is required obtain deformed features in DCNs also deployed existing NNAs directly. Although a general purposed processor (GPP) seated along can process convolution, processing GPP extremely slow due limited parallel computing capability massive additional data movement. To problem, we develop DCN accelerator support both standard convolution. Specifically, for DCNs, have output divided into tiles build tile dependency table (TDT) track at runtime. With TDT, further an on-chip scheduler handle efficiently. In addition, propose novel mapping strategy enable BLI apply layer fusion techniques more energy-efficient processing. According our experiments, achieves orders magnitude higher performance energy efficiency compared typical architectures including ARM, ARM+TPU, GPU 6.6% chip area penalty NNA.
منابع مشابه
Accelerating Analytics with Dynamic In-Memory Expressions
Oracle Database In-Memory (DBIM) accelerates analytic workload performance by orders of magnitude through an inmemory columnar format utilizing techniques such as SIMD vector processing, in-memory storage indexes, and optimized predicate evaluation and aggregation. With Oracle Database 12.2, Database In-Memory is further enhanced to accelerate analytic processing through a novel lightweight mec...
متن کاملDeformable texture : the irregular-regular-irregular cycle
Departures from a regular texture pattern can happen in many di erent dimensions. Previous related work has focused on faithful texture synthesis for near-regular texture departing along the color and intensity axes while the underlying geometric regularity is well preserved. In this paper, we address the issue of faithful texture synthesis for textures that have both the structural and color/i...
متن کاملFormalizing Memory Accesses and Interrupts
The hardware/software boundary in modern heterogeneous multicore computers is increasingly complex, and diverse across different platforms. A single memory access by a core or DMA engine traverses multiple hardware translation and caching steps, and the destination memory cell or register often appears at different physical addresses for different cores. Interrupts pass through a complex topolo...
متن کاملA Discrete Singular Convolution Method for the Seepage Analysis in Porous Media with Irregular Geometry
A novel discrete singular convolution (DSC) formulation is presented for the seepage analysis in irregular geometric porous media. The DSC is a new promising numerical approach which has been recently applied to solve several engineering problems. For a medium with regular geometry, realizing of the DSC for the seepage analysis is straight forward. But DSC implementation for a medium with ir...
متن کاملA Compilation Framework for Irregular Memory Accesses on the Cell Broadband Engine
A class of scientific problems represents a physical system in the form of sparse and irregular kernels. Parallelizing scientific applications that comprise of sparse data structures on the Cell Broadband Engine (Cell BE) is a challenging problem as the memory access pattern is irregular and cannot be determined at compile time. In this paper we present a compiler framework for the Cell BE that...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Design Automation of Electronic Systems
سال: 2023
ISSN: ['1084-4309', '1557-7309']
DOI: https://doi.org/10.1145/3597431